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DETAILED ACTION 

Response to Amendment 

1 . In response to the Office Action mailed December 10, 2007, applicant submitted 
an amendment filed on April 10, 2008, in which the applicant traversed and requested 
reconsideration. 



Response to Arguments 

2. Applicant argues that the prior art cited does not teach the claim phrase 
"traversing substrings of the natural language input in an order determined by the 
weights assigned to the breakpoints. Applicant argues that the Shapiro reference 
teaches a search engine for conducting searches. Words in a search query are ordered 
according to importance, determined by the frequency of use of the word in the 
database. Such use of weighting to conduct a search has absolutely no relation to the 
Applicant's invention for segmenting compound words in an unrestricted natural 
language input. However, in response to applicant's argument that weighting to conduct 
a search has absolutely no relation to the Applicant's invention for segmenting 
compound words in an unrestricted natural language input, the fact that applicant has 
recognized another advantage which would flow naturally from following the suggestion 
of the prior art cannot be the basis for patentability when the differences would 
otherwise be obvious. See Ex parte Obiaya, 227 USPQ 58, 60 (Bd. Pat. App. & Inter. 
1985). However, the Examiner has reconsidered the Shapiro reference as a whole and 
has withdrawn the rejection; however the claims are rejected in view of new grounds of 
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rejections. McKeown discloses a method of assigning a predefined segmentation 
weight (column 6, lines 51-57) and also uses a segment significance module using a 
sequential approach for determining segment importance and coverage (column 8, lines 
24-49). 

Claim Rejections - 35 USC § 103 

3. The following is a quotation of 35 U.S.C. 1 03(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

4. Claims 1-2 and 14-15 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Carcus et al. (USPN 6,035,268) hereinafter referenced as Carcus in 
view of McKeown et al. (USPN 9,473,730), hereinafter referenced as McKeown and in 
further view of Matsubayashi et al. (USPN 6,473,754), hereinafter referenced as 
Matsubayashi. 

Regarding claim 1, Carcus discloses a method for segmenting compound words 
in an unrestricted natural-language input, the method comprising: 

receiving a natural-language input (natural language) consisting of a plurality of 
characters (receiving stream of input text; compound word; column 1, line 19 -column 
7, line 18); 
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constructing a set of probabilistic breakpoints in the natural-language input based 
on probabilistic breakpoint analysis (statistical analysis; column 1, line 19 -column 7, 
line 18); 

identifying a plurality of linkable components by traversal of substrings of the 
natural-language (natural language) input delimited by the set of probabilistic 
breakpoints (word breaks) wherein a linkable component (link) is identified by locating 
the component in a lexicon (lexicon; column 1, line 19 -column 7, line 18); and 

returning a segmented string consisting of a plurality of linkable components 
spanning (spanning) the natural-language input, wherein the segmented string is 
interpreted as a compound word (compound word; column 1, line 19 -column 7, line 
18), but does not specifically teach assigning weights to the breakpoints in the natural- 
language input and traversing substrings of the natural-language input in an order 
determined by the weights assigned to the breakpoints and combining a probability. 

McKeown discloses a method wherein assigning weights to the breakpoints 
(assigns a predefined segmentation weight; column 6, lines 51-57) in the natural- 
language input (natural language; column 1, lines 38-50) and traversing substrings of 
the natural-language input in an order determined by the weights assigned to the 
breakpoints (sequential approach for determining segment importance and coverage; 
column 8, lines 24-49), to efficiently and accurately identify segment. 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to modify Carcus' method wherein it is described as 
above, to provide a method that segments information according to the segment 
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function and the importance, for efficient and accurate segmentation (column 1, lines 
29-35). 

Carcus in view McKeown discloses a method of segmenting compound words, 
but does not specifically teach combining a probability that characters preceding each 
breakpoint end a word and a probability that characters following the breakpoint start a 
word to assign weights to the breakpoints in the natural-language input. 

Matsubayashi discloses a method of combining a probability that characters 
preceding each breakpoint (probability of division) end a word (tail-position) and a 
probability that characters following the breakpoint start a word (head-position) to assign 
weights to the breakpoints in the natural-language input (column 3, line 41 - column 4, 
line 41 and column 15, lines 47-56 and column 16, line 65 - column 17, Iine10 with 
column 20, lines 27-53), to extract characteristic string with less erroneous division. 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to modify Carcus in view of McKeown's method wherein it 
combines a probability, as taught by Matsubayashi, to extract characteristic string with 
less erroneous division and this less search noise to realize searching of a relevant 
document or documents with less shift from the main concept of a seed document 
(column 5, lines 26-35). 

Regarding claim 2, Carcus discloses a method further including the step of 
analyzing a chart of the linkable components in the case that the segmented string 
cannot be constructed and returning an unsegmented string interpretable as a partial 
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analysis of a compound word (removed from the word breaker; column 1, line 19 - 
column 7, line 18 and column 36-54). 

Regarding claim 14, it is interpreted and rejected for the same reasons as set for 
h in claim 1 . In addition, Carcus discloses a method wherein assigning weights 
comprises combining weights of contexts of one length that precede a breakpoint and of 
contexts of a different length that follow the breakpoint (inherent in parsing; column 1 , 
paragraphs 0010-0011 and column 2, paragraph 0023). 

Regarding claim 15, it is interpreted and rejected for the same reasons as set 
forth in claim 1 . In addition, Carcus discloses a method wherein assigning weights 
comprises weighting weights of a plurality of context of different lengths that precede 
and follow a breakpoint (inherent in parsing; column 1 , paragraphs 001 0-001 1 and 
column 2, paragraph 0023). 

Allowable Subject Matter 

5. Claims 3-9 and 1 1-13 are allowed. 

The following is a statement of reasons for allowance: 
As for independent claim 3, it recites an apparatus for segmenting compound 
words in a natural-language input. Prior art such as Franz show a similar configuration 
but fails to teach the recited configuration wherein a probabilistic breakpoint analyzer is 
coupled to the startpoint probability matrix, the endpoint probability matrix and the 
natural-language input, the probabilistic breakpoint analyzer being operative to generate 
a breakpoint-annotated input from the natural-language input. 
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Dependent claims 4-9 are allowed because they further limit their parent claims. 
As for independent claim 11 , it recites an apparatus for segmenting compound 
words in an unrestricted natural-language input. Prior art such as Carcus, Shapiro and 
Matsubayashi show a similar method but fails to teach the recited method of combining 
weights of trigraph context that precede and follow each breakpoint to assign a weight 
to the breakpoint in the natural-language input in combination with the other limitations. 

As for independent claim 12, it recites an apparatus for segmenting compound 
words in an unrestricted natural-language input. Prior art such as Carcus, Shapiro and 
Matsubayashi show a similar method but fails to teach the recited method of combining 
weights of bigraph context that precede and follow each breakpoint to assign a weight 
to the breakpoint in the natural-language input in combination with the other limitations. 

As for independent claim 13, it recites an apparatus for segmenting compound 
words in an unrestricted natural-language input. Prior art such as Carcus, Shapiro and 
Matsubayashi show a similar method but fails to teach the recited method of combining 
weights of tetragraph context that precede and follow each breakpoint to assign a 
weight to the breakpoint in the natural-language input in combination with the other 
limitations. 

Conclusion 

6. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to JAKIEDA R. JACKSON whose telephone number is 
(571)272-7619. The examiner can normally be reached on Monday-Friday from 
5:30am-2:00pm. 
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If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, David Hudspeth can be reached on 571-272-7843. The fax phone number 
for the organization where this application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 
JRJ 

June 23, 2008 
/David R Hudspeth/ 

Supervisory Patent Examiner, Art Unit 2626 



